AUTOMATION IN MUSEUM COLLECTIONS 1 



By Raymond B. Manning 
Smithsonian Institution , Washington , D. C. 

Introduction 

For a little over a year now several of us in the Smithsonian 
have been associated with a project designed to investigate 
possible uses of electronic data processing ( EDP ) for computer 
storage and retrieval of specimen-associated data. The project 
has been funded under a contract with the Office of Education 
of the Department of Health, Education, and Welfare. It is a 
joint effort by members of the staff of the Museum of Natural 
History and Information Systems Center, Smithsonian Institu- 
tion. 

The project is based on the thesis that a museum collection 
is more than an assemblage of inanimate objects or dead orga- 
nisms; it is a vast information resource which we cannot ade- 
quately use with current methods of record keeping. A second 
factor, which is also quite important, is that collections are con- 
tinually growing at a rapid rate. In the Department of Inverte- 
brate Zoology alone, the collections are increasing by at least 
200,000 specimens per year. This trend is hardly likely to change 
in the near future, and if specimen-associated data in the 
collections is too difficult to obtain now, it will be even less 
available in the future. If computerized data record-keeping 
systems are going to be developed and used, the project must 
be started now. Delays will only increase the difficulties and 
the cost. 

The MNH project was designed to set up record-keeping sys- 
tems in three separate areas of the museum: marine rocks, un- 

1 This work was supported by a grant from the Library and Information Sciences 
Research Branch, Office of Education, Department of Health, Education, and Wel- 
fare, OEG-1-071 159-4425. 
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der the supervision of William Melson, Department of Mineral 
Sciences; oceanic birds, under the supervision of George Wat- 
son, Department of Vertebrate Zoology; and marine crus- 
taceans, particularly stomatopods, under the author's supervi- 
sion. Donald F. Squires, then Deputy Director of the Museum, 
was among the first to recognize the need for data processing 
in the museum and it was he who sought and obtained support 
to initiate the pilot project. 

In Crustacea, the project is designed to aid management of 
the collection, to aid curation, and to enhance the collection as 
a research tool. Our primary aim was to update some of our 
techniques of collection management, and to develop tech- 
niques which would allow us to manipulate specimen-associ- 
ated data without having to return to the collection every time 
we work with the data. 

The overall MNH project has been divided into two phases: 
( 1 ) to build a data bank based primarily, but not exclusively, 
on three separate collections, and (2) to manipulate the data 
in various ways to evaluate the overall costs of not only various 
portions of the project but also the costs of general handling 
and processing of museum collections, regardless of the ulti- 
mate use and method of storage and retrieval of the data. 

The first IS months of the contract, ending in December, 
1968, have been concerned with entering the data, building up 
the base, and solving the innumerable problems that arose at 
every step. The next phase will deal principally with interroga- 
tion of the data base. 

My remarks are designed to give a progress report on activi- 
ties in Crustacea, an idea of some of the different problems 
which we have encountered and some of the results of the proj- 
ect. 

We are not alone in the scientific community in our interest 
in developing a system for storage and retrieval of specimen 
data. The British Museum (Natural History) and the National 
Museum of Canada are both working on developing such a 
system, and system development is being considered by over 
30 museums around the world. More than 70 representatives 
of universities, museums, zoological parks, and botanical gar- 
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dens met in Mexico City, in 1967 to discuss “Information Prob- 
lems in the Natural Sciences.” Bullis and Roe (1967), report- 
ing on a bionumeric code used by the Bureau of Commercial 
Fisheries Exploratory Fishing Base, Pascagoula, Mississippi, 
noted that faunal data resulting from their exploratory fishing 
operations necessitated development of computer methodology 
for handling the data. 

Those of us associated with the MNH project are not the only 
ones in the museum working with different applications of data 
processing. In the Department of Botany, Stanwyn Shetler is 
applying EDP to the broad Flora of North America Project, 
and Mason Hale is developing a type-catalog, and a list of gen- 
eral accession records and invoice data in Botany have been 
computerized for some time. James Peters, Department of 
Vertebrate Zoology, has pioneered within the museum in devel- 
oping computer programs to carry out time-consuming statisti- 
cal analyses commonly used in taxonomic studies. 

In a recent article on curation of invertebrate collections, 
Emerson and Ross (1965, p. 337) noted that: “The ideal method 
for cataloguing specimens and the retrieval of catalogue and 
specimen information would be a punch card or magnetic tape 
system. Vast amounts of information could be stored in a 
relatively small space and retrieved within seconds. Unfortu- 
nately, none of the museums in the United States has yet in- 
stalled such a system.” 



Background 

The Division of Crustacea was most fortunate to receive 
through the foresight and industry of members of its fore- 
runner, the Division of Marine Invertebrates, a remarkable 
file of specimen records in a 3 X 5 card format. This card file, 
consisting of about 125,000 entries, serves as a guide or index to 
the collection, a source of information on loans and holdings in 
general (management data), and a basic source of specimen- 
associated data. The existence of the record file, in a museum 
where specimen records, other than specimen labels and cata- 
log books, are in general absent, is a tribute to the persever- 
ance and foresight of such people as Mary Jane Rathbun, 
Waldo L. Schmitt, and Fenner A. Chace, Jr. The Marine In- 
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vertebrate catalog system has existed for approximately 70 
years. 

Our new system in Crustacea was designed as closely as 
possible to that already in existence in the Division. We wanted 
to show that we could continue the basic operations of docu- 
mentation and cataloging of the collection and prepare the 
data for computer storage at the same time. 

Briefly, let me summarize the pre-computer method of cata- 
loging. A eataloger would compile the data necessary for each 
entry, hand enter it into a ledger catalog, hand-write the label, 
type two copies of a specimen data card (one copy was to be 
filed in the species file, the other to be filed in a geographic 
file), and type a neck label for the jar. 

In 1965, we instituted a change in the cataloging procedure 
by installing a typewriter system in which a punched-paper 
tape could be generated during the initial typing of the data. 
The machine used is a CDC (SCM) Typetronic 2816 with two 
typewriter consoles, one featuring microelite type (with 16 
characters to the inch) and one featuring standard elite type 
( 12 characters to the inch). As the data is entered on the speci- 
men label on the microtypewriter, the machine generates a 
punched paper tape which can then be used to reproduce 
automatically on the other typewriter as many 3x5 cards as 
needed for the files. The jar neck label also can be typed from 
the tape. The system was developed with the expectation that 
some day the data on the paper tapes could be converted to 
magnetic tape, but it was installed almost two years before we 
received support for the MNH project. 

The cards, labels, and neck labels are printed in long per- 
forated strips which are much easier to feed into the typewriter. 
Pink cards and distinctly-marked neck labels are used for types. 
Originally, we planned on printing up three sets of 3 X 5 cards, 
one for the species file, one to be filed in numerical order as a 
replacement for the permanent ledger catalog, and one for the 
geographic file. Henry B. Roberts, Senior Museum Specialist in 
the Division of Crustacea, did most of the work involved in 
developing the new card format from the old one. 

At the time the program was started, Smith-Corona-Marchant 
(SCM) (now Control Data Corporation) was the only manu- 
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Table 1 . Data Organization, Division of Crustacea. 



Field 


Maximum 


Name 


Length 


Nomenclature type 


15 


spaces 


Catalog number 


8 


u 


Genus name 


21 


n 


Subgenus name, if used 


21 


m 


Species name 


21 


it 


Subspecies name, if used 


21 


it 


Author 


50 


n 


Total number of specimens 


5 


it 


Location I: Continent, country, ocean 


30 


ti 


Location II: State, province, island group 


30 


it 


Location III: County, parish, small island 


30 


ii 


Location IV: City, lake, miscellaneous 


70 


ii 


Latitude and Longitude 


48 


H 


Collection Gear 


20 


ii 


Depth 


20 


n 


Collector 


20 


it 


Collector’s number 


12 


ii 


Date of collection 


11 


n 


Identifier 


33 


n 


Date of identification 


11 


it 


Number and sex of specimens 


45 


ii 


Accession number 


10 


ii 


Type of entry (gift, etc.) 


13 


M 


Publication information 


45 


H 


Preservative 


3 


ii 


General remarks, and overflow from 180a 


45 


it 


General remarks 


45 


it 


General remarks 


45 


n 


General remarks 


30 


ii 


Data cataloged 


11 


it 



facturer of a system with a micro-elite typewriter. Since then, 
we have learned that SCM no longer will manufacture the 
Typetronic. Fortunately, perhaps, for those who require the 
micro-typewriter, Friden now manufactures one and can supply 
a system comparable to the Typetronic. 

A more detailed account of the development and use of the 
cataloging procedure was given by Squires ( 1966). 
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USNM - Department of Invertebrate Zoology PARA TYPE 

cat. no. 100932 * Lysiosquilla gray! Chace 

• • • • 2 specs. 7 

LOCAL. 

# # # # United States ; Massachusetts; Cape Cod;Bass 

,...y ver 

® • • • ' DEPTH 

COLL. BY Gray> M % B. 

DET - BY Chace, F. A. Jr. 

NO/SEX 2d - , 5? 

ACC. NO. 206768 ENTERED AS 

REMARKS 



intertidal 

18 Mar 1957 



• • • 
• • ® 
• • • 
• • • 
• • • 
• • • 
• • • 



ale 



Biological Bulletin, Woods Hole, vol. 114, 
no. 2, p. 141, pi. 1, figs. 1-5, 1958. 
Muddy sand at low water. 



172-REV. 9-20-67 



DATE CAT. 14 NOV 1957 



USNM. INVERTEBRATE ZOOLOGY PARriTYPE 

100932 9 Iysiosquilla grayi Chace 

5 specs. 7 

United States;Massachucetts^Cape Cod; 
Bass River/ 



depth intertidal 

Gray, M. B. 18 fikr 1,937 

Chace, F.A. ,Jr. 

Figure 1 . Catalog card and corresponding label used in Division of 
Crustacea, Smithsonian Institution. 



Types of Data Entered 

In our cataloging operation in Crustacea, the basic unit is a 
lot; each lot contains one or more specimens. Basic data for 
each lot are collected and verified by a cataloger who may re- 
ceive the lot with no more data than name, number and sex of 
specimens, identifier and date identified, station number and 
vessel, and accession number. These data are expanded by the 
cataloger to include as much of the information shown in Table 
1 as possible. 

Each of these types of information must be entered on the 
original 3x5 card (Figure 1) which we retain in the division 
file. In Crustacea we use data assigned to 30 different fields; 
often some of these items are left blank. In the experiment on 
birds some 39 fields are used and in minerals approximately 140 
fields have been identified. 
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LYSIOSQUILLA GRAYI 
CRUSTACEA 



WIP ACCESSION 



100432 



010A 
020A 
020A 
035A 
04 0A 
051A 
052A 
053A 
08 OA 
090A 
110A 
120A 
I30A 
140A 

150A 

160A 

170A 

180A 

190A 

200A 

230A 



NOMENCLATURE TYPE 
MUSEUM ABBREVIATION 
CATALOG NUMBER 
AUTHOR 

NUMBER OF SPECIMENS 

MAJOR LOCALITY 

SECONDARY LOCALITY 

SPECIFIC LOCALITY 

DEPTH 

COLLECTOR 

OATE OF COLLECTION 

IDENTIFIER 

IDENTIFICATION OATE 

NUMBER AND SEX 

ACCESSION NUMBER 
TYPE OF ENTRY 
PRESERVATIVE 
PUBLICATION INFO 
REMARKS I 
REMARKS II 
OATE CATALOGUED 



PARATYPE 

USNM 

0100932 

CHACE 

00007 

UNITED STATES 
MASSACHUSETTS 
CAPE COO/BASS RIVER 

00000 METERS, VARIANCE 0 METERS GIVEN AS INTERTIDAL 
GRAY, S. 

18 MAR 1957 
CHACE, F. A. JR. 



MALE MI M II M JV FEMALE F OV F JV JV LARVAE 
2 5 

USNM 206768 

ALC 

BIOLOGICAL BULLETIN, WOOOS HOLE, VOL. 114, 

NO. 2, P. 141, PL. 1, FIGS. 1-5, 1958. 

MUOOY SAND AT LOW WATER 
14 NOV 1957' 



Figure 2. Work-in-progress listing for same entry as shown in Figure 1. 



Our limiting factor here is perhaps the number of characters 
we can enter on a 3 X 5 card. The system we use has the capa- 
bility of storing some 4000 characters per catalog entry; we 
have used less than 800 in developing the data card in Crus- 
tacea. 

The first two lines include the basic information pertaining 
to that log, catalog number, name, and total number of speci- 
mens; these are the initial data used by those who work with 
the files and are coincidentally the basic invoice data. 

Data on the card down to and including the “Determined by” 
level on the card also appears on the specimen label. The re- 
mainder of the information appears in the ledger catalog and 
on the card, but not on the label. The label for this same lot is 
also shown in Figure 1. Card size was determined by the exist- 
ing files and label size was determined by the size of our basic 
specimen vial for smaller specimens. 

Upon completion of our cataloging process for a series of 
specimens, the tape generated by the Typetronic is forwarded to 
the Computer Center where the data are converted by the com- 
puter to magnetic tape. Then the data items are reshuffled by 
the computer to produce a Work-In-Progress Listing (WIP), a 
preliminary printout (Figure 2). We use this now for a second 
proofing of the original entry. The machine will automatically 
mark several kinds of errors, including erroneously marked 
fields, fields with no data, spelling errors in the data, data in 
wrong fields, etc. 
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Corrections may be made at any of the steps in the catalog- 
ing process. The punched paper tape can he corrected after 
the card and label have been proofread, and corrections of 
data on the WIP listings can be keypunched to update the data 
in the computer. 

We have the potential of retrieving data items by field or by 
any combination of fields. Further, the data can be rearranged 
by fields in any format which one might require. 

Note that depths are converted to meters; if a depth range is 
given in the original entry, the midpoint is the first depth given 
in the printout. In printouts, the midpoint (in meters), the 
range (in meters) and the original entry, as given in feet, 
meters, or fathoms, are all reproduced. All depth conversions 
are automatic. 

We have left several fields vacant in the section entitled 
REMARKS. These can be used for habitat information, for ref- 
erences to field notes or color photos, and so on. These fields 
are unrestricted at the present time. 

The basic card system in Crustacea is extremely adaptable. 
Although designed lor marine organisms, it has been modified 
with little effort to include crayfishes where locality data may 
be centered on drainage system and where information on as- 
sociated species, cross-referenced to field notes, is required. We 
are now adapting the format to free-living marine nematodes 
and to cephalopods. 

I want to discuss here in a little more detail two of the types 
of information entered and used, Nomenclature and Geography. 

Nomenclature 

The system is designed so that a taxon is a focal point for 
entry and retrieval of data. For each group a master taxa list 
must be compiled and entered. 

In preparing the master list for the stomatopods, I have in- 
cluded major synonyms. The entries for one species, Odonto- 
dactylus hrevirostris (Miers), are shown in Figure 3. 

Data on this species can be requested by using any of the 
synonyms or the senior synonym in the query. Few of our col- 
lections are up to date nomenclatorially for few of us have the 
help required to keep up with the name changes. By being able 
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000 0002 330 G ODONTOOACTYLUS 

000 0002 331 RG BIGELOW, 1893 



000 0002 340 B 
000 0002 350 RB 
000 0002 355 SB 
000 0002 356 RB 
000 0002 360 SB 
000 0002 361 RB 
000 0002 370 SB 
000 0002 371 RB 
000 0002 380 SB 
000 0002 381 RB 
000 0002 390 SB 
000 0002 391 RB 
000 0002 400 SB 
000 0002 401 RB 



ODONTODACTYLUS 
(MIERS, 1884) 
GONODACTYLUS 
MIERS, 1884 
GONODACTYLUS 
BIGELOW, 1393 
OOONTODACTYLUS 
FOCOCK, 1893 
ODONTODACTYLUS 
BORRADAILE , 1907 
OOONTODACTYLUS 
KEMP, 1911 
OOONTODACTYLUS 
CHACE, 1942 



BREVIROSTRIS 

BREVIROSTRIS 

HAVANENSIS 

HANSENII 

LATIROSTRIS 

SOUTHWELL I 

NIGRICAUDATUS 



000 0002 410 B 
000 0002 420 RB 
000 0002 421 SB 
000 0002 422 RB 



ODONTODACTYLUS 
(WHITE, 1850) 
GONODACTYLUS 
WHITE, 1350 



CULTRIFER 

CULTRIFER 



000 0002 430 B OOONTOOACTYLUS HAWAIIENSIS 

000 0002 440 RB MANNING, 1967 



GENUS 

REMARKS 



REMARKS 

SYNONYM 

REMARKS 

SYNONYM 

REMARKS 

SYNONYM 

REMARKS 

SYNONYM 

REMARKS 

SYNONYM 

REMARKS 

SYNONYM 

REMARKS 



REMARKS 

SYNONYM 

REMARKS 



REMARKS 



Figure 3. Synonymy of Odontodactijlus brevirostris (Miers) as stored 
in the computer Directory of Names. 



to identify and label synonyms, entries in the data bank under 
any of the names can be retrieved. 

A separate directory of names is maintained and, as catalog 
entries are added, the names are checked against the directory; 
entries accompanying names not in the directory are rejected as 
are entries under misspelled names. 

Neither the user nor the cataloger need be familiar with the 
numericlature used by the machine; only knowledge of the 
nomenclature is required. 

We have also developed a hierarchical classification, for we 
believed that data must be retrievable not only at the specific 
level but at any of several taxonomic levels. In Crustacea we 
have compiled a hierarchy down to suborder, to which we can 
eventually add families, genera, subgenera, and species. A 
portion of the crustacean hierarchy is shown in Figure 4. The 
numbers on the right are the numericlature; those on the left 
are part of a sequence of numbers required to enter the data 
originally and are not related to the numericlature. 

Our numerical code of 26 digits was developed to allow maxi- 
mum flexibility in adding to the hierarchy at any level and to 
maintain the specific name as the key to entry and retrieval of 
data. Neither the cataloger nor the scientist user needs to 
know the entire number sequence; it is internal in the com- 
puter. 
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SEPT 26, 1968 HIGHER TAXA NUMERICLATURE LISTING 

GROUP TYPE 

PAGE -LINE LVL TAXA NUMERICLATURE 



0000029125 


c 


LEPTOSTRACA 


1 


032 


008 


000 


0000 


0000 


000000 


00 


0000030130 


0 


NEBAL IACEA 


1 


032 


008 


001 


0000 


0000 


000000 


00 


0000031140 


G 


RHINOCARINA 


1 


032 


009 


000 


0000 


0000 


000000 


00 


0000032150 


C 


CE RAT I OCARINA 


1 


032 


010 


000 


0000 


0000 


000000 


00 


0000032160 


SC 


CERATOCARINA 


1 


032 


010 


000 


0000 


0000 


000000 


01 


0000033170 


C 


NAHECARIDA 


l 


032 


011 


000 


0000 


0000 


000000 


00 


0000034200 


C 


SYNCARIDA 


1 


032 


012 


000 


0000 


0000 


000000 


00 


0000034205 


D 


ANASPIDACEA 


1 


032 


012 


001 


0000 


0000 


000000 


00 


0000034206 


D 


BATHYNELLACEA 


1 


032 


012 


002 


0000 


0000 


000000 


00 


0000035209 


C 


PERACARIDA 


1 


032 


013 


000 


0000 


0000 


000000 


00 


0000036220 


D 


THERMOS BAENACE A 


1 


032 


013 


001 


0000 


0000 


000000 


00 


0000037230 


D 


SPELAEOGRIPHACEA 


1 


032 


013 


002 


0000 


0000 


000000 


00 


0000037235 


0 


MYS IDACEA 


1 


032 


013 


003 


0000 


0000 


000000 


00 


0000039250 


0 


CUMACEA 


1 


032 


013 


004 


0000 


0000 


000000 


00 


0000039260 


SO 


SYMPODA 


1 


032 


013 


004 


0000 


0000 


000000 


01 


0000040270 


0 


TANA IDACEA 


1 


032 


013 


005 


0000 


0000 


000000 


00 


0000040280 


SO 


CHELIFERA 


1 


032 


013 


005 


0000 


0000 


000000 


01 


0000040290 


SO 


ANISOPODA 


1 


032 


013 


005 


0000 


0000 


000000 


02 


0000041300 


0 


ISOPODA 


1 


032 


013 


006 


0000 


0000 


000000 


00 


0000042310 


0 


AMPHIPODA 


1 


032 


013 


007 


0000 


0000 


000000 


00 


0000042320 


SO 


LAEMODIPODA 


1 


032 


013 


007 


0000 


0000 


000000 


01 


0000042325 


C 


EUCARIDA 


1 


032 


014 


000 


0000 


0000 


000000 


00 


0000043330 


0 


EUPHAUS IACEA 


1 


032 


014 


001 


0000 


0000 


000000 


00 


0000044340 


0 


PYGOCEP HALOMORPHA 


1 


032 


014 


002 


0000 


0000 


000000 


00 


0000045350 


0 


DECAPODA 


1 


032 


014 


003 


0000 


0000 


000000 


00 


0000046358 


C 


HOPLOCARIDA 


1 


032 


015 


000 


0000 


0000 


000000 


00 


Figure 4. 


A portion of the crustacean classification used 


in the MNH 



project. 

The hierarchy, to order, was arbitrarily selected and entered 
into the computer. Aspects of classification can be updated at 
any time and categories below order can be entered at any 
time. The classification of invertebrates proposed by Black- 
welder (1963) was used as the basis for the hierarchy. 

Assume that we have completed sections in Crustacea and 
now plan to enter any other invertebrate group. The hierarchy 
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will accommodate 999 families under any order, and, further, 
up to 9 additional families can be entered at some future time 
between any 2 of the original entries, [without changing the 
numericlature assigned to the original families entered!. Simi- 
larly, up to 99 genera can be assigned to each family and 9 gen- 
era can be added between any of two of the original genera. 
The number of species that can be entered originally in each 
genus is 9999, with room to add 99 between any 2 of the orig- 
inal entries. 

Data can be entered by genus or higher category only and 
also be retrieved by those categories. Data associated with 
such designations as variety, forma, species near xus, new spe- 
cies, etc., can also be entered and retrieved. The system is 
flexible enough to handle subgenera and subspecies as well. 

Geography 

Emerson and Ross (1965) and Levi (1966) have commented 
on the importance of locality data in collections of inverte- 
brates, and other authors have cited one or more methods of re- 
cording geographic data, including distance and direction from 
a known point to a locality ( 10 mi. N., 4 mi. E ) ( Riemer, 1954; 
Hutchison, 1964), use of township, section, and range (Ax- 
tell, 1965), legal description (Wheeler, 1965), and so on. In 
studies on marine animals, latitude and longitude (Axtell, 
1965; Steward, 1965) or Marsden Square are commonly used. 
All of these methods have specific applications; none are used 
exclusively by all taxonomists. 

For these reasons we have had to develop our own geo- 
graphical code, called the Global Reference Code, designed by 
Reginald Creighton, Anthony Piacesci and Dick King of the 
Smithsonian Information Systems Center when it became ap- 
parent that existing methods of storing retrieving geographic 
data had too many limitations. Creighton and his colleagues 
are preparing a paper on development of the Code. 

The Global Reference Code is a hierarchical system in that 
several levels, from the general, such as Pacific Ocean, to spe- 
cific, such as Manila Bay are used, with room for four levels of 
complexity. Locality data may be entered by latitude and 
longitude or by name. 
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Latitude and longitude are assigned to specific localities by 
hand, and a separate geographic data bank is maintained by the 
computer. 

The smallest area defined by the GRC is a two-minute square. 
All data referring to localities within each two-minute square 
can be retrieved and printed out as originally entered. 

The following example will demonstrate the flexibility of the 
system. Soldier Key is a small island on the eastern edge of 
southern Biscavne Bay, Dade County, Miami, Florida. It may 
be identified, in different collections within the museum, as 
(a) Soldier Key, (b) a small isolated key 10 miles south of Key 
Biscavne, Miami, (c) a key north of Elliot Key, Biscayne Bay, 
or (d) it may have been identified originally by its coordinates 
or by compass bearing and range from a point. If information 
on species found at Soldier Key is required, the computer will 
select and print out all of the data given above, as originally 
entered. 

This has numerous benefits, for we can retrieve data by lati- 
tude and longitude, ocean, county, state, drainage system, 
Marsden Square, or by any of existing methods of recording 
locality data. 

Eventually, as the geographic data bank develops, routines 
can be developed so that it can be searched for specific geo- 
graphic data and reenter these into the catalog record auto- 
matically. 



Results 

Initially there was a serious lag between development of 
computer programs for data storage and manipulation, as well as 
development of formats for geography, so that by the time the 
programs were ready a large series of records had accumulated 
awaiting entry into the system. We then learned that all of our 
entries accumulated in this period had to be redone for a va- 
riety of reasons, primarily because the fields for each entry had 
not been identified correctly. The beginning and end of each 
field must be flagged or marked by the typewriter operator and 
the flag, an asterisk or an exclamation point, must appear on the 
tape, or, as far as the computer is concerned, the fields cannot 
be identified and the record is invalid. 
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Although we believed we had designed the system so that we 
could eventually phase out the ledger catalog, i.c., it would be 
replaced by a card file in numerical order, we quickly learned 
that a ledger or equivalent, in the form of a work sheet, was 
required in order to prepare and arrange the data. We still 
use the ledger and we have dropped, at least temporarily, the 
numerical card file. 

Now that the geographic data bank and the Global Refer- 
ence Code has been developed and is functional, we should be 
able to replace the geographic card in our cataloging operation. 

I have noted above that we learned that verification and as- 
sembly of data by a cataloger, in preparation for the actual 
typing operation, required the use of a ledger catalog or data 
entry sheet of some sort. From an operational point of view it 
was much simpler for the trained cataloger, a data specialist, 
to work with the specimens in an area not necessarily adjacent 
to the typewriter. Alcoholic collections are too messy for the 
basic work to be done at or near the typewriter. Initially, we 
used the cataloger not only as a data specialist but also as a 
typewriter operator. We have learned from our first year of op- 
erations that the cataloger can be reserved for the data-verifica- 
tion aspects of the operation, and that the data can be entered 
by a clerk-typist. Further, a clerk-typist can enter data far 
faster than any one cataloger can prepare it. 

As we recataloged the stomatopods, data for the types was 
entered along with data for all of the other specimens in the 
collection. In working with the preliminary printout, the Work- 
In-Progress listing, of the data from the types, it became ap- 
parent that it would be relatively simple to reformat the data 
and generate a type-catalog by the computer. 

A program has been developed and the printout has been re- 
quested. We expect to obtain in one printout an alphabetized 
list of the types, as cataloged under their original names, along 
with a list of the current names for those that have changed 
since the original description. 

Another finding is that our data input is not completely satis- 
factory, for we need the capability of including more habitat 
data. We had reserved space under “Remarks” for habitat data, 
reference to field records, color notes, and so on. In the case of 
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many of our specimens, there is relatively little information 
available other than locality, date, and collector. Now the 
number of records with habitat data is so slight that our in- 
quiries can be w orded so that only records with information in 
the “Remarks” section are printed. We wall be working on this. 

On the cards themselves, we have reserved space at the bot- 
tom to edge-punch the cards with the basic data for invoices; 
catalog number, name, and total number of specimens. If the 
cards were punched as they were processed, we could even- 
tually be in a position to generate invoices of loans from our 
cataloged collection by stacking cards, in the order desired, in 
the Typetronic reader and let the machine automatically type 
out the basic data. Unfortunately, edge punching obliterates 
any entries in the “Remarks” section so we have not imple- 
mented this as yet. 

Possible Experimental Programs 

During the second part of the project we plan to begin manip- 
ulation of the data base built up during the first 18 months of 
operation. Interrogations designed to test the capabilities of 
the system and to provide information as retrieval costs might 
include the following: 

(1) List and count all species in genus represented in collec- 
tion. 

(2) Determine number of species in given collection and state 
source. 

(3) List all materials identified from accession Z. 

(4) What species of Family X occur together in depth range 
X-Y in the northwestern Atlantic. Plot distribution pattern 
of each species. 

(5) What species in genus X are not represented in the col- 
lections. 

(6) List by accession number material of family Y from the 
eastern Pacific not yet identified to species. 

( 7 ) List species occurring at island X. 

(8) List species collected by ALBATROSS at Sta. X. 

(9) List type-species of family X or order Y not represented by 
materials in collection. 
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In addition to working out various queries and testing the 
system, I would like to work on several items. I have already 
noted that in some cases adequate data, available from field 
notes, etc., cannot be entered on the 3X5 specimen card. I 
would like to develop a station data Directory, similar to our 
geographic Directory, in which all data pertaining to a partic- 
ular station or collection could be entered and also be availa- 
ble for combination with the basic catalog data by the com- 
puter. 

For example, Dr. Waldo L. Schmitt kept extensive field notes 
which relate directly to many specimens now in the crustacean 
collection. If we could enter all of this by station number and 
have the computer add this data to each printout record, we 
could greatly increase the amount of information available on 
each specimen. For the purposes of management of our collec- 
tion, this information is not neccessarily needed in our species 
file. Its value for research purposes is obvious. 

Similarly, data from Smithsonian expeditions and other ex- 
peditions as well might be stored and tied to the specimen rec- 
ord by the station number. 

We also plan to develop a basic catalog card for parasites 
and commensals from our crustacean holdings which w ill 
emphasize host and ecological data. 

During the coming year we plan to investigate various ap- 
plications of the computer and the computer-based specimen 
data bank to our routine operations. The possibilities are un- 
limited. We should be able to generate invoices giving com- 
plete specimen data, lists of holdings for visitors, lists of hold- 
ings for exchange purposes, as well as a machine-generated 
catalog in tabular form which could be bound and retained as 
a permanent record. Entry of present unidentified material 
(identified to family or genus level, but not species level), would 
enhance our abilities to make our collections and these data 
available to visitors. 

Development of off-campus storage facilities for inactive 
collections, in my opinion, is dependent on development of 
banks of data associated with those collections; removal of col- 
lections to off-campus facilities without documentation of the 
material would be tantamount to destruction of the material. 
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I would also like to see the museum develop methods for 
computer storage of records from the literature as well as data 

associated with specimens in our collection. 
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